Spatial asset management system and method

ABSTRACT

A data collection and automatic database population system which combines global positioning system (GPS), speech recognition software, radio frequency (RF) communications, and geographic information system (GIS) to allow rapid capture of field data, asset tracking, and automatic transfer of the data to a GIS database. A pre-defined grammar allows observations to be continuously captured along with GPS location and time, and stored on the field mobile unit. A mobile unit&#39;s location is tracked in real time or post processed through wireless RF transmission of location information between the mobile unit and a central processing station. The captured data is electronically transferred to a central processing station for quality assurance and automatic population of the GIS database. The system provides for automatic correlation of field data with other GIS database layers. Tools to generate predefined or user defined reports, work orders, and general data queries allow exploitation of the GIS database.

CROSS REFERENCE TO RELATED APPLICATION

This application is a divisional application of application Ser. No. 08/714,583, filed on Sep. 16, 1996 now U.S. Pat. No. 6,272,457, which is hereby incorporated herein by reference.

FIELD OF THE INVENTION

This invention relates to methods for combining Global Positioning System (“GPS”), Speech Recognition, Radio Frequency (“RF”), and Geographic Information System (“GIS”) to perform mobile field data collection and automatic population of a GIS database with fully attributed and correlated observation data. The system relates particularly to a field data capture system and automatic GIS database population tool for a user to build GIS layers and fully exploit the data in the GIS.

BACKGROUND OF THE INVENTION

Organizations responsible for the maintenance and inventory of assets are turning to GIS as the tool of choice to manage and display these assets. Over eighty percent of the cost of a GIS is capturing and placing accurate, fully attributed data into the GIS. These costs have prohibited many users from either implementing or fully exploiting the GIS.

A number of different methods have been developed for capturing data in the field. Many users use the data collection method of traveling an inspection route, visually identifying the location, and hand writing a description onto a form or a paper entry. Once the inspector returns to the central data repository the entries so collected are manually entered into a database with questionable accuracy and time consuming labor. The user must build the correlation and association logic into the database to create a useful tool. Back end applications must also be created so that the information is useful to the user. More sophisticated methods include GPS with push button data collection or pen computer data entry units which allow predefined buttons and menus to be used for field data collection. The data can be electronically downloaded into a database, but a user must still build the correlation and association logic. The information downloaded is limited to point information with limited attribute information.

Audio based data entry systems have been developed but are limited to the recording of street point information sequenced with a manually recorded location input. The user is then required to manually convert, transfer, and combine the location data with the audio data. There is no processing of the audio data and manual transcription, and tagging of the entries with location data must be manually performed by the user. Only location data where a observation has been recorded is stored, and all other location information is ignored. Other speech recognition systems require the user to prerecord their speech to replace keyboard entries. None of the described systems provide the automatic population of the GIS with fully attributed and correlated data generated from speech recognition.

As users of spatial data incorporate GIS and GPS based technology, the need for a flexible, true end to end system that collects field data, populates a GIS, tracks field assets, and provides tools to exploit the data will increase.

SUMMARY OF THE INVENTION

It is an object of the present invention to provide a method and system for a speech recognition based field data capture system, asset tracking, and automatic GIS database population tool for a user to build GIS layers, to track assets, and to fully exploit the data in the GIS.

It is an object of the present invention to combine GPS, Speech Recognition, and GIS, and to provide field data collection, automatic GIS database population, and exploitation of the GIS data.

It is an object of the present invention to provide the real time tracking of assets in the field through the combination of GPS and RF communications.

In furtherance of these objects, a field mobile unit capable of continuously capturing feature observations from predefined grammar and free speech, as well as GPS based location information time-stamped and automatically stored on the units onboard memory, is created. Location information is automatically corrected in the field using Differential Global Positioning Service (“DGPS”) and RF wireless data transmission. The location information is automatically combined with observation information to provide a continuous record of locations and observations.

The preferred mobile field unit device is mounted in a vehicle or backpack. The audio headset microphone provides the means for initiating a speech-based description of user observations. The mobile unit computer provides the onboard data storage of speech observations and the GPS time-stamped location signal. The unit provides the ability to electronically transfer field data. The unit provides an audio feedback to the user to optimize speech entry start and stop, as well as notify the user of loss of GPS signal. The grammar structure provides self editing tools as well as a comprehensive description of field observations.

In the preferred form of the invention the location and observation information is transferred electronically to the central data repository or via RF wireless media. The audio data process automatically converts the audio data collected in the field using the semantic information in the reference grammar and creates data records representing the information content of the user's verbal observations. The user can validate and correct observation statements. Interactive tools allow the user to review all speech entries and correct them as required. The results are user validated and grammatically valid.

The preferred form of the invention automatically merges the corrected location data and the recognized text data and precisely synchronizes the verbal data to a location, as well as identifying any continuous span of tracks covered by an observation. The data is then automatically entered into the GIS database and correlated to linear networks and point observations within the central data repository.

The preferred form of the invention provides predefined or customer configurable tools to exploit the data in the central data repository. Work orders, custom reports, and data query scripts are created using these tools.

The vehicle location information is derived from GPS which provides a time-stamp from which absolute location coordinates may be determined through interpolation of recorded GPS data points.

Methods and apparatus which incorporate the features described above and which are effective to function as described above comprise specific objects of the present invention.

Other and further objects of the present invention will be apparent from the following description and claims and are illustrated in the accompanying drawings, which by way of illustration, show preferred embodiments of the present invention and the principles thereof and what are now considered to be the best modes contemplated for applying these principles.

BRIEF DESCRIPTION OF THE DRAWING VIEWS

FIG. 1 is a diagrammatic view of a spatial asset management system constructed in accordance with one embodiment of the present invention. FIG. 1 shows the processes, the data elements used in the processing, and the user interaction with the system. FIG. 1 is a high level overview of the system.

FIG. 2 is a diagrammatic view showing the details of the 1.0 Data Conversion process of FIG. 1. FIG. 2 shows the 1.0 Data Conversion processing in conjunction with the collected data elements and the reference data elements and the user interaction. FIG. 2 shows both the Audio Data 1.A and GPS Data 1.B going through their appropriate processing paths and being merged into an Observation 1.G. FIG. 2 also shows, in the component labeled Track 1.F, the historical representation of where the field operator had been and when the field operator had been there. The Observation 1.G and the Track 1.F are two key outputs of the 1.0 Data Conversion process shown in FIG. 2. Semantic analysis is performed in the 1.6 Interpret Text process and by use of the Reference Observation Semantics 1.E to create the Observation 1.G.

FIG. 3 is a diagrammatic view showing details of the in the 2.0 Data Correlation process of FIG. 1. FIG. 3 shows the two main data inputs (the Track 1.F and the Observation 1.G) coming from the 1.0 Data Conversion process shown in FIG. 2. FIG. 3 shows that Track 1.F is first correlated to the Reference Network 1.K. FIG. 3 also shows that the input information Track 1.F and Observation 1.G are correlated to the Reference Network 1.K and to the appropriate other layers of the GIS creating a Tour 1.L object. The Tour 1.L object comprises: who collected the data; what data was collected; where the field operator was; what the field operator was doing; when the field operator was collecting the data; and the correlation results.

FIG. 4 is a diagrammatic view showing the 3.0 Repository Update process as updated with the Tour 1.L results. FIG. 4 also shows, the 3.3 Define Repository process and the 3.5 Configure Tour process, the definition of the repository structure. FIG. 5 is a pictorial view, in plan, showing an example of data collection in the field. FIG. 5 shows a vehicle traveling north on Elm Street. FIG. 5 shows the position of the vehicle by its GPS points and shows two observation events indicated by the numerals 1 and 2. The data input from the observation events is voice data, indicated by the quotations in FIG. 5.

FIG. 6 shows the processing sequence for data conversion for the two specific observation events identified in FIG. 5. FIG. 6 also shows the semantic analysis of associating observation event 2 to observation event 1. The results of the semantic analyses are indicated by the inclined block arrow in the lower part of FIG. 6.

FIG. 7 is a diagrammatic view illustrating the four primary types of data maintained within the Repository 1.M of the system shown in FIG. 1. In FIG. 7 the arrows indicate the data structure relationships. As illustrated in FIG. 7, Assets can always be associated with other Assets, Condition must be associated with an Asset, Defect must be associated with an Asset, and Repair can be associated only with a Defect. FIG. 7 also shows the structure for each of the primary data types. The processing information portion of the structure of each primary observation type is embodied in the association (indicated by the arrows), the Spatial Type information, and the Storage Layer and Associated Layers information. Each of the primary observation types also have Location and Attributes in its structure.

FIG. 8 requires too much illustration area to be capable of being shown on one sheet of drawing and is therefore composed of FIG. 8A (on one sheet of drawings) and FIG. 8B (on the succeeding sheet of drawings). FIG. 8 is an example grammar of the type used in FIGS. 5 and 6 but for a specific asphalt distress observation type. Each of the boxes shown in FIG. 8 represent different sentence types. The two observation events illustrated in FIG. 5 correspond to the respective top box and bottom box in FIG. 8. The semantic information identifying that the second sentence is a modifier of the first sentence is illustrated by the two dashed lines in FIG. 8—the first dashed line going from “Tag:blob” up to the term “blob” and the second dashed line going from “Tag:area” up to the term “area” in the Observation Template. The observation statements in FIG. 5 correspond to the Recognized Text 2.A in FIG. 2, and the Reference Observation Semantics 1.E of FIG. 2 correspond to the information contained in the Asphalt Project Grammar of FIG. 8.

FIG. 9 is an illustration of the 2.0 Data Correlation process using the example illustrated in FIG. 5 and continuing the example shown in FIG. 6. FIG. 5 shows data collection. FIG. 6 shows data conversion. FIG. 9 shows data correlation. FIG. 9 shows how an observation in track data is correlated to an asset (note the results of the correlation show that the Defect is correlated to the street segment on Elm Street between First Street and Second Street). FIG. 9 also illustrates the process of moving data into the appropriate GIS layers in the spatial asset management system of the present invention.

DETAILED DESCRIPTION OF THE INVENTION

FIG. 1 presents an overview of a preferred form of the spatial asset management system and method. Subsequent FIGS. 2-4 expand each major process shown in FIG. 1. For example, the process 1.0 Data Conversion (the top circle in FIG. 1) is expanded into a more detailed flow chart in FIG. 2.

The spatial asset management system and method described herein is a hardware independent system solution for managing assets with a strong geospatial component. The preferred form of the system is implemented in a commercial off-the-shelf laptop or pen-based computer for the mobile system component and a high performance PC for the processing workstation home base computer.

The three data stores Audio Data 1.A, GPS Data 1.B, and Sensor Data 1.C shown in FIG. 1 are generated in the mobile system laptop computer. All subsequent processes and data stores are maintained in the home base computer or workstation.

The system provides for a seamless, fully automatic capture, translation, GIS import, and analysis/processing of asset information as represented by the Audio Data 1.A, GPS Data 1.B, and Sensor Data 1.C stores developed during collection. The mobile unit computer may be hand carried (e.g., backpack) or mounted on a moving vehicle (e.g., car, truck, bicycle).

FIG. 5 illustrates the collection of data whereby the user can drive, or walk along, an inspection route and can comment on observed defects, assets, asset condition or other observations. Also shown in FIG. 5 are the GPS points that are collected by the system.

FIG. 6 shows how the observations in FIG. 5 are processed functionally by the system to become data items that are correlated against existing asset information and analyzed for corrective action by operations personnel.

The mobile unit computer is configured with a commercial GPS receiver (or other location receiver device), a standard commercial sound board, and standard I/O devices (e.g., printer, disk drive, RS-232 ports) along with a battery or external power source. Other sensor inputs include such sensors as digital cameras, laser ranging devices, and others. For example, digital camera sensor input allows for photos to be included of city code violations. In this case the digital photo image is automatically tagged and tracked by the system so that photo evidence is included directly in the violation report sent to the offender.

Voice observations are automatically correlated with the sensor inputs to be incorporated as an associated data record. The mobile unit computer captures and time-stamps all data store records. Each data store is independent and no other synchronized signal or input is required other than standard precision time. The Audio Data 1.A store contains all speech audio data detected by the mobile system sound card. The GPS Data 1.B store includes location derived information containing latitude, longitude, and altitude of the mobile unit on a continuous basis once the unit is initialized. The Sensor Data 1.C store contains any external sensor records, such as switch on/off states, analog values, digital photos, laser ranging data, etc.

As will be described in more detail with reference to FIG. 2, the 1.0 Data Conversion process means receives the mobile unit data from Audio Data 1.A, GPS Data 1.B, and Sensor Data 1.C data stores described above. The 1.0 Data Conversion process operates on these inputs in conjunction with reference data (Reference Grammar 1.D, Reference Observation Semantics 1.E, and Reference DGPS Data 1.H) to produce Track 1.F objects and Observation 1.G objects data stores. The functions supported by the 1.0 Data Conversion process are: (1) automatic interpretation of audio data spoken words using a reference dictionary contained within the Reference Grammar 1.D data store, (2) automatic detection of word level interpretation error conditions, (3) automatic interpretation of phrases using pre-defined meaning and phrase syntax contained within the Reference Observation Semantics 1.E data stores, (4) automatic detection of semantic error conditions, (5) optional correction of GPS location data using Reference DGPS Data 1.H, (6) automatic generation of time based location Track 1.F data objects in internal system format, (7) automatic generation of time-based Observation 1.G data objects and internal system format and (8) operator use of interactive displays to perform Quality Assurance (QA) functions against either Audio Data 1.A or Sensor Data 1.C stores.

The net result of the 1.0 Data Conversion process is a data store of error corrected track information which is an automated time-sequenced track of the mobile unit's historical travel path with precise latitude, longitude and altitude for a given “Tour” (note that Tours are actually generated by the 2.0 Data Correlation process).

Another result of 1.0 Data Conversion process is a time-sequenced and operator quality assurance checked set of observation objects, which represent either “discrete” observations (e.g., “tree, foliage damage,” “stop sign, class 1 damage,” “pothole, high, right”), “linear” observations (e.g., “start curb and gutter run,” “end curb and gutter run,” “start road width 32,” “end road”) or “polygon” definitions which is a unique form of track data store. These Track 1.F and Observation 1.G data stores are available to the 2.0 Data Correlation process.

FIG. 6 illustrates the buildup of these data types. The system organizes data into a logical contiguous set of collected data that may last from a few minutes to several hours. A street inspection tour, for example, would typically consist of the collection of street distress data for several hours before concluding the collection and submitting the collected data to the home base workstation for processing.

The “discrete” observations captured include any and all assets which are best categorized as an item or set of items at discrete locations. Examples of types of objects collected are signage, lights, street distresses, concrete distresses, park benches, tree damage, utility cuts, utility access covers, fire plugs, incidences of code violations (e.g., weeds, illegal cars parked, damaged fence, etc.), curb damage, sidewalk distresses, and other items of the like. Usually discrete object observations are accompanied by a status, state, or condition which are related to the object and position, a size, or other descriptive term that may help identify or qualify the observation. The phrase “pothole, medium, right,” would be translated by the 1.0 Data Conversion process to mean:

“pothole”=pothole type of road distress;

“medium”=distress severity medium;

“right”=the right lane (assuming more than one lane in the current direction of travel).

Similarly “linear” observations are used for assets or objects that are running or continuous in nature for some significant length. Examples are roads, sidewalks, curbs, gutters, fences, paint stripping, property frontage, and others of the like. Linear objects are usually accompanied by state or condition, plus an indication that the running asset starts or stops at some position.

An example might be when an inspector is monitoring the condition of road centerline paint conditions. A phrase may be “start road centerline paint condition 3” which would mean that the inspector is reporting the beginning of a class 3 (e.g., badly worn) status of road stripping condition. This condition may last for several miles. When the condition changes the inspection would terminate the running asset condition with a phrase such as “end road centerline condition 3.”

The system interprets and keeps track of all running asset states. In addition the inspector may continue commenting on any other objects or observations while the linear conditions are being tracked. That is to say that the inspection can start a running asset observation (like the road paint stripping), then report on several defects (such as sign damage), and then terminate the running asset conditions. The system automatically keeps track of all such interleaved conditions. Logic errors are automatically detected and identified to the operator during the Quality Assurance processing with the 1.0 Data Conversion process.

Another observation data type is “polygonal.” Polygonal data is usually associated with defining areas or boundaries. Using a backpack mounted system, a parks inspector might, for example, walk and define the boundaries of an area of a park, perform a tree or endangered species inventory or forest damage by some infestation. The results would be a polygon that describes the area where the observations are located.

As described in more detail below, the 2.0 Data Correlation process means operates on the Track 1.F and Observation 1.G data stores which are output by the 1.0 Data Conversion process means to perform correlation against a variety of reference data. The 2.0 Data Correlation process organizes and associates Track 1.F data stores with Observation 1.G data stores which are output to produce logical “tours,” which are sets of data (collected by the user) such as those discussed earlier.

The 2.0 Data Correlation process automatically routes data items to the proper layer of the GIS database for further processing. For example, signage would be associated with a specific layer of GIS whereas street distresses would be associated with a separate layer. The 2.0 Data Correlation process uses the Reference Asset 1.J data store to correlate the collected discrete asset observation tour data to the existing database of objects (e.g., signs, park benches, etc.) of the same category or class.

The system automatically detects inconsistencies between the collected and reference asset data and brings problems to the attention of the field operator. These inconsistencies can be corrected or edited using Quality Assurance tools provided. Ultimately the reference asset database is updated for future reference.

Similarly, observation tour data which represents discrete defects, (e.g., road potholes, fence damage, curb upheaval, etc.) are correlated and compared against the Reference Defect 1.I data store and are quality assured for consistency and logical error state by the 2.0 Data Correlation process. The 2.0 Data Correlation process also performs the same type of functions for linear observations tour data, such as curbing and sidewalk networks, using the Reference Network 1.K data store. A set of Edit and Quality Assurance tools are provided to support the correlation processing of network type data.

Reference Network 1.K data stores include simple tour location Track 1.F data as well (which allows the system to capture and compare location track data independent of collected discrete, or linear objects). This enables the system to identify which inspectors have inspected which streets and when. It also allows a broad range of tour analysis functions to be accomplished, such as, which areas have streets that have not been inspected for the last three months.

The general functionality supported by the 2.0 Data Correlation process are (1) automatic association of collected data to proper GIS layers, (2) automatic detection of inconsistencies between collected observations and reference data, (3) correction of conflicted data, (4) analysis of tour location track information such as routes traveled with temporal reference, (5) quality assurance of correlated data, and (6) the organization and association of Track 1.F and Observation 1.G into “tours” which are correlated location, observation, and time data sets.

The 3.0 Repository Update process means provide all of the tools to create, update, and generally manage the system reference databases. A primary input to this process is the Tour 1.L data store which is generated by the 2.0 Data Correlation process. The 3.0 Repository Update process provides the tools to create new assets and/or conditions the system will recognize by updating the Reference Grammar 1.D data store and the Reference Observation Semantics 1.E data store along with the appropriate Reference Asset 1.J, Reference Defect 1.I, or Reference Network 1.K data stores. Using this function allows the user to add new types of defects (such as a new type of damage or new class of utility cut in the road), add new asset types, add new tour types (such as utility inspection tours), and any other operational data elements needed.

Data management tools include editing, data consistency checking, data integrity and version control, and backup tools. Operational data store elements are maintained in the Repository 1.M database. The Repository 1.M data store is where the results of system processing are placed.

Using a variety of GIS configured, third party, and Spatial Asset System tools, the field operator/user can gain access to the operational database for analysis and reporting purposes. The analysis and reporting tools include both ad-hoc and predefined analysis and reporting capabilities. They range from such capabilities as visual analysis and interrogation of GIS layers to specific reports on such elements as road defect history in a given neighborhood.

The user can query and generate reports on any and all data contained within the Repository 1.M data stores. Using these tools the user can ask such questions as:

-   -   How many of a specific asset type is located within center         boundaries?     -   What are the specific work orders (time to accomplish, etc.) to         repair specified road segments?     -   Show the inspection routes covered by a specified inspector over         a given period of time.     -   Show all road signs that are severely damaged and what is an         optimal route for repair.

FIG. 2 is a detailed diagrammatic view of the 1.0 Data Conversion process of FIG. 1. From the field collection process the results of the operator's verbal inputs are represented by the data store labeled Audio Data 1.A. These are time-stamped digital audio data segments corresponding to each verbal phrase spoken by the field operator.

The data store identified by the label GPS Data 1.B represents all of the GPS data collected in the field during the operator's trip. The Reference DGPS Data 1.H store is the DGPS correction data collected during the operator's trip.

The 1.1 Correct Location Bias process applies the correction data to the GPS data, if it was not corrected in the field using real-time DGPS. Note that in the preferred implementation the field GPS units can be used in either real-time DGPS mode or post-processing DGPS mode, depending upon the needs of the field operator.

The results of the 1.1 Correct Location Bias process is DGPS corrected location data that is then stored in the Corrected Location 2.B data store. The corrected data is then processed, by 1.2 Vectorize Location Data, to convert the individual point data, (typically collected at 1 second intervals, but any interval period is possible), into track data which is stored in Track 1.F. The purpose of this processing is to compress the point data into a higher order representation of linear and are based tracks. This compression greatly improves the performance of latter processing illustrated in FIG. 3.

The 1.3 Recognize Audio Data process automatically converts the Audio Data 1.A collected in the field using the semantic information in Reference Grammar 1.D, and creates intermediate data records (Recognized Text 2.A) representing textually/linguistically the information content of the operator's verbal statements made in the field. Note that the field unit can record the audio data in either of two ways. First, it can recognize when voice is present and only record when the operator is speaking, which is the preferred approach. Or the field unit can record all data regardless of whether the operator is speaking.

In the latter case, the 1.3 Recognized Audio Data process will break the continuous audio data into the individual spoken phrases using the same approach as the field unit would use, i.e., energy threshold of the audio data. The user then can validate and correct any problems with the results through the 1.4 Verify Speech Recognition process. With the interactive tools provided in this process the user can review all of the automatic recognition processing and fix any problems encountered.

The Reference Grammar 1.D information is used to maintain the integrity of the resulting fixes. The Track 1.F information is used to provide visual location information to the operator on where they were at the time they made the verbal statement. The results from 1.4 Verify Speech Recognition processing are stored into Recognized Text 2.A. These results are both user validated and grammatically valid.

The 1.5 Assign Location process automatically merges the Track 1.F data and the Recognized Text 2.A data, precisely synchronizing the verbal data to the location data and identifying any contiguous span of tracks covered by an observation. The resulting merged data is forwarded to the 1.6 Interpret Text process. This process uses the Reference Observation Semantic 1.E information to merge the sequence of recognized text into actual Observation 1.G.

It should be noted that the system can take a non-contiguous set of verbal statements and combine them into a single observation. An example of this process is discussed latter, relative to FIG. 8.

The 1.6 Interpret Text process performs the semantic analysis on the sequence of recognized text to determine if it is complete and consistent.

FIG. 4 is the diagrammatic view of the repository maintenance functions. The user interacts with the system through these functions to define the data to be collected and merge the collected data into a central data Repository 1.M. The user interacts with three functions to perform repository maintenance.

The user, through a series of displays in 3.3 Define Repository process, defines the data to be collected and the grammars with semantics used to process the collected field data. The user, through a display in the 3.5 Configure Tour process, identifies what types of data is collected during his field data collection session. By identifying the types of data collected, the system applies the appropriate grammars and semantics to translate the data collected in the field into database records. The user also enters his name, organization and other relevant information.

The user, through a series of displays in the 3.1 Merge Repository Updates process, merges the data collected in the field into the central Repository 1.M. The assets, conditions, defects, and repairs are compared to the appropriate layer of historical data. Any discrepancies in the data are flagged and presented to the user for resolution. A discrepancy is identified when the new data is not consistent with the data already resident in the central Repository 1.M. After discrepancies in the data are resolved, the user approves the changes and the central Repository 1.M is updated.

The 3.6 Collect DGPS Data function continuously collects GPS reference data from a connectable Reference GPS Receiver and stores it in central Repository 1.M. This data is used to correct errors in the field collected GPS data. This correction can be performed post-processed or in real time.

The Repository 1.M data contains all the data for the system including all data stores discussed in earlier figures. This is data collected in the field, historical data, data used but not changed by the system, and reference data. Central Repository 1.M contains, as a minimum, the following historical data: Assets, Conditions, Defects, and Repairs. Central Repository 1.M contains, as a minimum, the following reference data: DGPS Data, Grammars, Semantics, and Imagery.

The Tour 1.F data store contains the information collected in the field and statistics about the field session. The information contained in the tour is at a minimum: the inspector, data, duration, type of inspection, and correctly formatted repository updates. The 3.2 Extract Field Data process provides the function of combining tour data with other historical data stores for processing and use by the user.

FIG. 5 shows an example of data collection in the field. FIG. 5 shows a vehicle V traveling north on Elm street. FIG. 5 shows the position of the vehicle V by its GPS points and shows two observation events indicated by the numerals 1 and 2. The data input from the observation events is voice data, indicated by the quotations in FIG. 5.

FIG. 6 shows the processing sequence for data conversion for the two specific observation events 1 and 2 identified in FIG. 5. FIG. 6 also shows the semantic analysis of associating observation event 2 to observation event 1. The results of the semantic analyses are indicated by the inclined block arrow in the lower part of FIG. 6.

FIG. 7 is the diagrammatic view of the four primary observations types. These four observations represent the possible data collected in the field and maintained in the Repository 1.M and are described in more detail immediately below.

Asset

Asset represents objects in the field that the user wishes to track and maintain. Examples of assets are: street signs, side walks, and curbs. Assets can be related to other assets. For example, a street sign that has one post and two signs attached can be represented as three assets that are associated together. Both Assets and Defects (below) have a spatial type (e.g., point, linear or polygonal). The spatial type and the associated layers information define how the asset information is correlated to other GIS layers during the automatic correlation processing shown in FIG. 3.

For example, street sign assets may be associated to a side GIS layer. This association defines that the location of the sign asset should be altered during processing to snap (or adjust) its position to be on the street edge, not in the street. Similarly, for defects, such as a concrete defect, (e.g., a crack), will be associated to the concrete network asset layer, which in turn is associated with the street edge layer.

Condition

Condition represents attributes of an asset that change over time and or position. The condition of the assets may be established in the system through grammar tables to allow the user to collect a predefined range and classes of conditions. For example, the conditions for street sign could be good, fair, and poor.

Defect

Defect represents a defined flaw in the asset that affect the health or goodness of the asset. Defects can also be set through grammars to reflect a type of defect or a severity.

Repair

Repair is the removal of a defect. As a repair is made the central data Repository 1.M can be updated to reflect the repair and the defect is then automatically removed from the database.

The diagrammatic view of FIG. 7 illustrates the four primary types of data maintained within central Repository 1.M of the system shown in FIG. 1 and also the possible relationships of the types of data. In FIG. 7 (as illustrated by the diagram box in the bottom left hand corner of FIG. 7) the arrows indicate the possible associations of the data structure relationships. Thus, as illustrated in FIG. 7, Assets can always be associated with other Assets, Condition must be associated with an Asset, Defect must be associated with an Asset, and Repair can be associated only with a Defect. FIG. 7 also shows the structure for each of the primary data types. The processing information portion of the structure of each primary observation type is embodied in the association (indicated by the arrows), the Spatial Type information, and the Storage Layer and Associated Layers information. Each of the primary observation types also has Location and Attributes in its structure.

As noted above in the Brief Description of the Drawing Views, FIG. 8 required too much illustration area to be capable of being shown on one sheet of drawings and was therefore composed of FIG. 8A (on one sheet of drawings) and FIG. 8B (on the succeeding sheet of drawings). Since it was necessary to show FIG. 8 on two sheets, the textual content of FIG. 8 is also set out below in this text for convenience in reference.

FIG. 8 is an example grammar of the type used in FIGS. 5 and 6 but for a specific asphalt distress observation type. Each of the boxes shown in FIG. 8 represent different sentence types. The two observation events illustrated in FIG. 5 correspond to the respective top box and bottom box in FIG. 8. The semantic information identifying that the second sentence is a modifier of the first sentence is illustrated by the two dash lines in FIG. 8: the first dashed line going from “Tag:blob” up to the term “blob” and the second dashed line going from “Tag:area” up to “area” in the Observation Template. The observation statements in FIG. 5 correspond to the Recognized Text 2.A in FIG. 2, and the Reference Observation Semantics 1.E of FIG. 2 correspond to the information contained in the asphalt project grammar of FIG. 8.

As noted above, FIG. 8 is an example grammar to the type used in FIGS. 5 and 6 but for a specific asphalt distress observation type. This example grammar illustrates one possible implementation of our method. There are two main sections illustrated in FIG. 8: the Observation Templates and the Sentence Templates. Each of the spoken sentences and the resulting Observation Templates are shown for the examples used in FIGS. 5 and 6.

In the first Observation Template, shrparea, the structure of the resulting observation is defined by the specification enclosed by the “{ }”. The “% s” identifies the type of GIS record to create. The “% t” identifies that time is to be included. The “% p” identifies that location is to be included. The “% e” identifies the several different slot values that are to be included (note the “:center” following the streetpos specification indicates that the value of center is a default). The “% m” identifies that there is a future modifying statement to include, and if not found, then “blob,sqft,50” is the default. The semantic relationship between the two separate verbal sentences is further illustrated by the dashed lines that indicate associations between templates, and between sentences and templates.

FIG. 8 further illustrates the semantic structure of the sentence templates. Each sentence, which corresponds to a set of possible verbal statements, is composed of slots. The information of how slot values are transferred to the observation record is defined by the ProType attribute of each slot.

For the first sentence “shrpdistressarea” each of the slots are copied into the resulting observation record based on slot tag. For the “areasqft” sentence the numeric values are combined to form a true number that is, by convention, assigned to the “area” slot, with tag “sqft,” and that is then copied into the “sqft % n” specification of the “blob” Observation Template. In this case the “% n” implies a numeric value required. The result of using this semantic information enables the two distinct verbal observations made in the examples of FIGS. 5 and 6 to be combined automatically into one resulting GIS record.

FIG. 9 illustrates graphically the data correlation process for the examples illustrated in FIGS. 5, 6, and 8.

While data collection is in progress, GPS data points are continuously collected, as well as the audio data and the other sensor data (see FIG. 2). The GPS data record contains the location as well as the time-stamp for that location.

When the system detects voice input by the user, a raw observation is created. This raw observation consists of the recorded voice and a time-stamp. Time is used as the synchronization key between all of the independent data streams: GPS, Audio, and Sensor.

The GPS data points are then compressed into a series of tracks (vectors and arcs) that represent the route taken by the user. Each of the track records consist of a start and stop position. An observation record's location information is determined using time and the GPS data to interpolate the location and the associated track and position along the track. The record consists of the observations text data and other sensor data, the track it was associated to, and the position along the track that the observation was made. These pieces of information are used to correlate the route taken and the observations made to the underlying network segments, which in this example are the street segments that were driven.

In the example shown, the user drives the city streets and makes observations about the condition of the streets. A typical point observation statement is “hole medium.” This observation is correlated to the street network, and a record is added to the Asphalt Distress Layer of the GIS. An example of a running observation is the combination “Segment Start”, “Surface Asphalt” and “Segment End”. These statements make a running observation which would be converted into a series of Asphalt Surface records for each street segment, and partial segment driven over between the “Segment Start” and “Segment End” statements.

Thus, as shown in FIG. 9 the collected GPS data is converted into the Track 1.F data. The Track 1.F data is correlated with the Street Network data. FIG. 9 also shows Defect data being loaded into its Asphalt Distress Layer. This Defect data from the Asphalt Distress Layer is then combined with the Street Network correlation results to create the association of the Defect with the Asset. The process from the GPS data layer to the track data layer (illustrated diagrammatically in FIG. 9) is also illustrated by the 1.2 Vectorize Location Data process in FIG. 2. The linkage from the track layer to the street network layer (illustrated in FIG. 9) is also illustrated by the 2.1 Correlate Tracks To Network process in FIG. 3. The input of the Defect data into the Asphalt Distress Layer (illustrated in FIG. 9) is also illustrated by the 1.6 Interpret Text process of FIG. 2. The linkage between the Asphalt Distress Layer and the Street Network Layer (illustrated in FIG. 9) is also illustrated by the 2.3 Correlate Observation To Network process in FIG. 3. FIG. 9 diagrammatically illustrates the example of FIG. 8 with respect to the two events noted on Elm Street as illustrated in FIG. 5.

While we have illustrated and described the preferred embodiments of our invention, it is to be understood that these are capable of variation and modification, and we therefore do not wish to be limited to the precise details set forth, but desire to avail ourselves of such changes and alterations as fall within the purview of the following claims. 

1. A method for automatically processing and managing spatial asset information in a repository, the method comprising: defining instances primary observation types and associations between each of the specific instances; identifying reference networks and geographic information system asset layers in the repository; configuring the repository based on said instances definitions and said associations; collecting field data; converting said collected field data to specific observations; correlating said specific observations to the appropriate said reference network and said geographic information system asset layers; and updating said appropriate geographic information system asset layers in the repository; wherein said collecting of field data step further comprises: capturing free speech stating verbal observations containing voice data; capturing location data contemporaneously with each of said verbal observations; time-stamping each of said captured verbal observations to create a raw verbal observation; and time-stamping said capture location.
 2. The method defined in claim 1 further comprising: configuring automatically analysis tools to exploit data in the repository so that said analysis tools are based on said instances definitions.
 3. The method defined in claim 1 further comprising: using a mobile unit computer in communication with a global positioning satellite receiver to capture said location data.
 4. The method defined in claim 1 further comprising: using a mobile unit computer in communication with a sound board for capturing said free speech stating verbal observations containing voice data.
 5. The method defined in claim 1 further comprising: constructing a predefined reference grammar for said voice data to be captured; incorporating a semantic relationship structure in said predefined reference grammar; and using said predefined reference grammar and said semantic relationship structure to enable said voice data in at least two of said raw verbal observations to be combined automatically with said captured location data into a single record in a geographic information system database.
 6. The method defined in claim 5 further comprising: using a mobile unit computer for capturing both said location data and said free speech stating verbal observations containing voice data; and transferring said captured location data and said captured voice data from said mobile unit computer to a processing computer, wherein said processing computer combines said captured voice data and said captured location data into said single record which is stored in said geographic information system database which is connectable to said processing computer.
 7. The method defined in claim 5 further comprising: configuring said predefined reference grammar and said semantic relationship structure to a specific form for a specific application.
 8. The method defined in claim 7 wherein said specific application is a street maintenance application and said predefined reference grammar and said semantic relationship structure are configured to said specific form for use in said street maintenance application.
 9. The method defined in claim 5 further comprising: time-stamping all records in said geographic information system database using only a central processing unit clock of a computer.
 10. The method defined in claim 9 further comprising: capturing a third stream of sensor data; time-stamping all said captured third stream of sensor data by said central processing unit clock of said computer; and combining automatically specific items of sensor data into specific records having specific voice data and specific location data by using said time-stamping as a synchronizing key.
 11. An apparatus for automatically processing and managing spatial asset information in a repository, the apparatus comprising: defining means for defining instances of primary observation types and associations between each of the specific instances; identifying means for identifying reference networks and geographic information system asset layers in the repository; configuring means for configuring the repository based on said instances definitions and said associations; data collection means for collecting field data; converting means for converting said collected field data to specific observations; correlating means for correlating said specific observations to the appropriate said reference network and said geographic information system asset layers; and updating means for updating said appropriate geographic information system asset layers in the repository; wherein said data collection means captures free speech stating verbal observations containing voice data and also captures location data contemporaneously With each of said verbal observations, and wherein the apparatus includes time-stamping means for time-stamping each of said captured verbal observations to create a raw verbal observation and for time-stamping said captured location data.
 12. The apparatus defined in claim 11 further comprising: configuring means for automatically configuring analysis tools to exploit data in the repository so that said analysis tools are based on said instances definitions.
 13. The apparatus defined in claim 11 wherein said data collection means includes a mobile unit computer in communication with a global positioning satellite receiver to capture said location data.
 14. The apparatus defined in claim 11 wherein said data collection means includes a mobile unit computer in communication with a sound board for capturing said free speech stating verbal observations containing voice data.
 15. The apparatus defined in claim 11 further comprising: reference grammar means for interpreting captured voice data contained in a plurality of verbal observations; said reference grammar means incorporating a semantic relationship structure means for combining said captured voice data contained in at least two of said plurality of verbal observations; and data conversion processing means for using said reference grammar means and said semantic relationship structure means to enable said captured voice data in said at least two of said plurality of verbal observations to be combined automatically with said captured location data into a single record in a geographic information system database.
 16. The apparatus defined in claim 15 wherein said data collection means includes a mobile unit computer for capturing both said location data and said free speech stating verbal observations containing voice data, and wherein the apparatus further comprises: a processing computer and a data transfer means for transferring said captured location data and said captured voice data from said mobile unit computer to said processing computer which combines said captured voice data and said captured location data into a single record which is stored in said geographic information system database connectable to said processing computer.
 17. The apparatus defined in claim 15 further comprising: a grammar configuring means for configuring said reference grammar means and said semantic relationship structure means to a specific form for a specific application.
 18. The apparatus defined in claim 17 wherein said specific application is a street maintenance application and said reference grammar means and said semantic relationship structure means are configured to said specific form for use in said street maintenance application.
 19. The apparatus defined in claim 15 wherein said time-stamping means time-stamps all records in said geographic information system database using only a central processing unit clock of a computer.
 20. The apparatus defined in claim 19 wherein said data collection means captures a third stream of sensor data, and wherein said time-stamping means time-stamps all captured third stream of sensor data by said central processing unit clock of said computer, and wherein said converting means automatically combines specific items of sensor data into specific records having specific voice data and specific location data by using said time-stamping as a synchronizing key.
 21. Computer-readable media tangibly embodying a program of instructions executable computer perform a method for automatically processing and managing spatial asset information in repository, the method comprising: defining instances of primary observation types and associations between each of the specific instances; identifying reference networks and geographic information system asset layers In the repository; configuring the repository based on said instances definitions and said associations; collecting field data; converting said collected field data to specific observations; correlating said specific observations to the appropriate said reference network and said geographic information system asset layers; updating said appropriate geographic Information system asset layers in the repository; wherein said collecting of field data step further comprises: capturing free speech stating verbal observations containing voice data; capturing location data contemporaneously with each of said verbal observations; time-stamping each of said captured verbal create a raw verbal observation; and time-stamping said captured location data.
 22. The computer-readable media defined in claim 21 further comprising: configuring automatically analysis tools to exploit data in the repository so that said analysis tools are based on said instances definitions.
 23. The computer-readable media defined in claim 21 wherein the computer is a mobile unit computer in communication with a global positioning satellite receiver to capture said location data.
 24. The computer-readable media defined in claim 21 wherein the computer is a mobile unit computer in communication with a sound board for capturing said free speech stating verbal observations containing voice data.
 25. The computer-readable media defined in claim 21 further comprising: constructing a predefined reference grammar for said voice data to be captured; incorporation a semantic relationship structure in said predefined reference grammar; and using said predefined reference grammar and said semantic relationship structure to enable said voice data in at least two of said raw verbal observations to be combined automatically with said captured location data into a single record in a geographic information system database.
 26. The computer-readable media defined in claim 25 wherein the computer is a mobile unit computer used for capturing both said location data and said free speech stating verbal observations containing voice data, and further comprising: transferring said captured location data and said captured voice data from said mobile unit computer to a processing computer, wherein said processing computer combines said captured voice data and said captured location data into said single record which is stored in said geographic information system database which is connectable to said processing computer.
 27. A computer programmed to execute process for automatically processing and managing spatial asset information in a repository, the process defining instances comprising: defining instances of primary observation types and associations between each of the specific instances; identifying reference networks and geographic information system asset layers in the repository; configuring the repository based on said instances definitions and said associations; collecting field data; converting said collected field data to specific observations; correlating said specific observations to the appropriate said reference network and said geographic information system asset layers; and updating said appropriate geographic information system asset layers in the repository; capturing free speech stating verbal observations containing voice data; capturing location data contemporaneously with each of said verbal observations; time-stamping each of said captured verbal observations to create a raw verbal observation; and time-stamping said captured location data.
 28. The computer as in claim 27 wherein the process further comprises: configuring automatically analysis tools to exploit data in the repository so that said analysis tools are based on said instances definitions.
 29. The computer as in claim 27 wherein the process further comprises: using a mobile unit computer in communication with a global positioning satellite receiver to capture said location data.
 30. The computer as in claim 27 wherein the process further comprises: using a mobile unit computer in communication with a sound board for capturing said free speech stating verbal observations containing voice data.
 31. The computer as in claim 27 wherein the process further comprises: constructing a predefined reference grammar for said voice data to be captured; incorporating a semantic relationship structure in said predefined reference grammar; and using said predefined reference grammar and said semantic relationship structure to enable said voice data in at least two of said raw verbal observations to be combined automatically with said captured location data into a single record in a geographic information system database.
 32. The computer as in claim 31 wherein the process further comprises: using a mobile unit computer for capturing both said location data and said free speech stating verbal observations containing voice data, and transferring said captured location data and said captured voice data from said mobile unit computer to a processing computer, wherein said processing computer combines said captured voice data and said captured location data into said single record which is stored in said geographic information system database which is connectable to said processing computer.
 33. An apparatus for automatically processing and managing spatial asset information, the apparatus comprising: a processing computer for receiving a plurality of field data that has been collected; and a data repository connectable to said processing computer for receiving processing results of said processing computer, wherein said data repository further comprises, a plurality of reference networks; a geographic information system having a plurality asset layers; a plurality of pre-defined instances of primary observation types; and a plurality pre-defined associations between each of said plurality of pre-defined instances of primary observation types, wherein said data repository is configured based upon said plurality of pre-defined instances of primary observation types and said plurality of pre-defined associations; wherein said processing computer, converts each of said plurality of field data into an appropriate one said primary observation types; correlates each of said converted primary observation types of each of said plurality of field data to an appropriate one of said plurality of reference networks and an appropriate one said plurality of asset layers; and updates said appropriate one of said plurality of asset layers with each of said converted primary observation types of each of said plurality of field data; and wherein said collecting of field data further comprises: capturing free speech stating verbal observations containing voice data; capturing location data contemporaneously with each of said verbal observations; time-stamping each of said captured verbal create a raw verbal observation; and time-stamping said captured location data.
 34. The apparatus defined in claim 33 further comprising: a mobile unit computer for collecting said plurality of field data; and a radio frequency transmitter connectable to said mobile unit computer for transferring said collected field data to said processing computer.
 35. The apparatus defined in claim 34 wherein said mobile unit computer further comprises: a microphone connectable to a sound board for capturing free speech stating verbal observations containing voice data; and a global positioning satellite receiver for capturing location data contemporaneously with each of said verbal observations, wherein each of said captured verbal observations and said contemporaneously captured location data is time-stamped to create a plurality of raw verbal observations.
 36. The apparatus defined in claim 35 wherein said data repository further comprises: a geographic information system database; a reference grammar for interpreting captured voice data contained in said plurality of raw verbal observations, wherein said reference grammar has a semantic relationship structure for combining said captured voice data contained in at least two of said plurality of raw verbal observations; and a data converter for using said reference grammar and said semantic relationship structure to enable said captured voice data in said at least two of said plurality of verbal observations to be combined automatically with said captured location data into a single record in said geographic information system database.
 37. The apparatus defined in claim 36 wherein said reference grammar and said semantic relationship structure are configured for a specific form for a specific application.
 38. The apparatus defined in claim 37 wherein said specific application is a street maintenance application and said reference grammar and said semantic relationship structure are configured to said specific form for use in said street maintenance application.
 39. The apparatus defined in claim 36 wherein said processing computer further comprises: a processing computer clock used for time-stamping all records in said geographic information system database.
 40. The apparatus defined in claim 39 wherein said mobile unit computer captures a third stream of sensor data, and further wherein said processing computer clock is used to time-stamp all captured third stream of sensor data, and further wherein said data converter automatically combines specific items of sensor data into specific records having specific voice data and specific location data by using said time-stamping as a synchronizing key.
 41. The apparatus defined in claim 33 wherein said data repository further comprises: a plurality of analysis tools automatically configured to exploit data in said data repository based on said plurality of pre-defined instances of primary observation types. 